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Executive Summary 


The focus of this report is on major improvement and system test. Receiver sensitivity has been significantly im¬ 
proved and about 3 dB time reversal precoding gain has been observed based on system trial in indoor non-line-of- 
sight scenario. Theoretical study shows that additional gain can be achieved by using optimum waveform design. 
Listed below are major achievements during the quarter: 


• multiple integration window sizes; 

• comprehensive system test; 

• improved receiver sensitivity to -81 dBm; 

• 3 to 4 dB time reversal precoding gain in indoor non-line-of-sight scenario. 

We will continue to improve the system and conduct more measurements, while seeking some technology transfer 
opportunities. 
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CHAPTERL MAJOR IMPROVEMENTS 


Our major development work in the last quarter is to test and modify the system. The most significant part is the 
synchronization at the receiver side. 


1.1 Synchronization Improvements 

For a burst mode radio system, the synchronization has always been a challenge. In order to make the synchro¬ 
nization more robust for the test-bed, we have made some improvements, which include modification of the frame 
structure, average samples for start pattern searching and various thresholds adjustment. 


1.1.1 Synchronization Review 

Before talking about these synchronization improvements, let’s first review the receiver’s implementation structure 
and data flow. The functional diagram is shown as Fig. 1.1. The main modules include data interface between FPGA 
and ADC, integration, synchronization, decision, a finite state machine and thresholds control. Among them, the 
challenges at the receiver are mainly on the high speed interface, fast integration and the robust synchronization. We 
are currently using a 3 stage synchronization technique which is shown in Fig. 1.3. 



Figure 1.1: Receiver functional diagram 

Fig. 1.2 is the data flow for the receiver. First the interface module converts the high speed ADC output data to 
relatively low speed data streams through several steps. Then the data streams go to start pattern searching module 
to find the time of arrival and go to integration module for chip level energy integration. There are 32 fast integrators 
for integration, which are implemented by DSP cores embedded in Virtex-5 FPGA. The synchronization module 
combines the time of arrival signal and the integration results by finding the integrator with maximum energy, then 
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it gets the chip threshold. After that the receiver applies frame level synchronization to find the fine timing and make 
decisions. The finite state machine and variable thresholds control module coordinate all the processes. 


RX_A_PECL[7:0](X32) 

* =v> 

RX_P_PECL[7:0](X32) 

II 

64 samples s 2chips 


RX_PECL (15:01 (XttPy RX_d*U|63:0] -* data_0 data_1 .... data_31 data_32 _data 
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Module 
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0 10 10 


Figure 1.2: Receiver data flow 


1.1.2 Frame Structure 

The system is supposed to find the window with maximum energy in every 40 ns, which is the chip period. Fig. 
1.3 shows the new frame structure of the system. The first stage is to search the time of arrival; the second stage 
is to average 128 chips energy to find the chip threshold; the third stage is to synchronize the whole frame. Data 
demodulation is performed after the three-stage synchronization. Fig. 1.3 also shows the sliding window integration 
diagram and the state transition diagram. 

The modification in the synchronization head does not change the overall frame structure. The following parameters 
are kept: system’s frame length: 4096 chips or 163.84 /rs; payload efficiency: 94.49%; chip sync accuracy: one 
sample (1.25 ns); and frame sync accuracy: one chip (40 ns). If the overall clock accuracy is 20 PPM, then the 
maximum time drift is 163.84 x 1CT 6 = 3.276 [is. 


1.1.3 Dense Pulses 

However, even though we averaged the start pattern searching, wc observed that the synchronization error still 
mainly comes from the first stage. In this stage, the threshold is very sensitive to noise level and DC offset of ADC 
sampling, especially for longer distance transmission. So we try to send 5 chips long dense pulses at the beginning 
of each frame to make sure the time of arrival is found each frame since the average value of these chips should be 
much higher than ordinary ones. Fig. 1.4 shows the real picture of dense pulses at the beginning of each frame. 
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Find the window with maximum energy in 40 ns (Chip level) 
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Figure 1.3: The new frame structure design 
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Figure 1.4: The picture of dense pulse at the beginning of each frame 
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1.1.4 Start Pattern Averaging 

For the first synchronization stage, we accumulate the values of 32 consecutive samples and compare it with a 
threshold to claim whether it is one frame’s time of arrival. However, due to the interference and environment’s 
uncertainty, the first stage result should not be so reliable when it just counts on the value only one chip time, 
because the time frame is too short. So we try to average two chip time samples result to get the averaged value, and 
compare it with a threshold, Fig. 1.5 shows the diagram of this improvement, where we set a more strict condition to 
claim the time of arrival result by comparing 5 times(or other number) instead of only one, and is in 5 chips time(or 
other number), 3 high results are needed to claim the first stage synchronization result. After that, because we send 
dense pulses at the beginning of each frame and there is a interval between the pulses and the real chips, we delay 
two chips time before the beginning of second stage. 


1.1.5 Other modifications 

For the third stage,we also utilized 60 bits M sequence instead of 60 bits OOC code for frame synchronization. This 
is mainly because the randomness of M sequence is much better than OOC code. 

For time reversal system and non time reversal system, the received pulses are very different at the width, typically 
6 ns for time reversal and 15 ns for non time reversal. So in the energy integration module, the window size for 
integration should be different for different signal scenarios. This parameter is to be adjusted in integration module, 
as described in the last quarter report. 


1.2 Threshold Adjustment 


For energy detection based receiver, it is well known that the performance depends significantly on the choice 
of threshold level. A good threshold can be determined by using some channel quality indicator and feedback 
information provided by the digital processor (back-end) at the receiver. There is an optimum threshold for lowest 
error rate. To deal with ISI situation, it is proposed to set a threshold based on two worst signal cases. 

One phenomenon discovered is that a mono-bit ADC along with proper thresholding can be applied for quantizing 
the pre-filters coefficients, and the resulting signal still gets focused. It enables the development of a unique time 
reversal system with much lower complexity. Major points include: (1) optimization of the threshold for quantiza¬ 
tion; (2) development of an adaptive threshold method; (3) impact of tap spacing and length of the pre-filter. In our 
case, there are 4 variable thresholds at the receiver side with each corresponds to different stages of synchronization 
and detection. They are the threshold for start pattern searching, the threshold for ADC bias shift, the code distance 
for frame level synchronization and the threshold of chip threshold determine at the second synchronization stage. 
All these values are adjusted by the pushbuttons on the receiver Virtex-5 board and the led lights can show any 
threshold’s adjustment level. 

DC offset causes performance degradation in signal processing systems especially for high-speed applications. For 
our ADC board in the receiver side, the DC shift makes the true quantization value hard to determine, thus makes 
the threshold vulnerable, especially for the energy detection based receiver with OOK modulation. DC offset can be 
reduced in real-time by subtracting the mean amplitude from each sample, or by calibrating the ADC board. 
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Figure 1.5: Start pattern searching improvement 
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CHAPTER 2. FPGA IMPLEMENTATION 




2.1 FPGA Timing Closure Issues 


For FPGA implementation, meeting timing requirements in speed-critical designs has been always a challenge, and 
this is often an iterative process ensuring that each and every path in the design meets the required timing. Meeting 
timing closure is easy and automatic for relatively slow or small designs. However, most designs do not fall into 
that happy category and as each critical path is adjusted in order to meet timing, new ones are uncovered or created 
anew. In the case of FPGAs implemented at the 65nm technology node, for example, wire delays can account for 
80-to-90% of each path’s delay. 

An important technique for achieving successful timing closure on an aggressive design is to carefully review the 
most critical timing constraints. In our case, there are two critical paths at the transmitter: one is from the data 
loading module to the waveform generator module where huge parallel data steams exist, another critical path is 
in the parallel to serial conversion module requiring high clock rate. At the receiver side, the most critical path 
is from the integration module to the synchronization module which performs fast timing acquisition in parallel 
manner. Achieving clock-to-clock (global) timing for all internal signals in a synchronous design may be easy, 
but this simple approach will usually overconstrain the design and eventually lead to a failure. Therefore, we have 
applied different constraints to different paths depending on their timing requirements. 

The impact of timing constraints can be seen in Fig. 2.1, where the left picture shows the output waveform from 
the DAC when appropriate timing constraints are applied, while the right one shows the waveform with unwanted 
spikes when timing constraints are not properly added. 



Figure 2.1: Output waveforms comparison with and without applying proper timing constraints 
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2.2 Major FPGA Implementation Reports 


Table 2.1 shows the resource usage for the test-bed back-ends implemented on Xilinx Virtex-5 LXT FPGAs. Note 
that the FPGA chips used at the transmitter and receiver are of the same model, but the transmitter FPGA chip has 
1136 I/O ports while the receiver FPGA chip has only 665 I/O ports. As we can see, at both the transmitter and 
receiver there are a lot of FPGA resources available. The routing result is shown in Fig, 2.2, and the resource usages 
are visually displayed. 




Transmitter 


Receiver 


Figure 2.2: FPGA routing results for both transmitter and receiver 
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CHAPTER 2. FPGA IMPLEMENTATION 


Table 2.1: FPGA Implementation statistics at transmitter and receiver 


Resources 

Transmitter 
Amount used 

Transmitter 
Percent used 

Receiver 
Amount used 

Receiver 
Percent used 

Number of Slice Registers 

5314 

18% 

4184 

14% 

Number of Slice LUTs 

5019 

13% 

3088 

10% 

Number of occupied Slices 

1719 

24% 

1713 

23% 

Number of bonded IOBs 

40 

8% 

90 

25% 

Number of BlockRAM/FIFO 

2 

3% 

1 

1% 

Number of BUFG/BUFGCTRLs 

7 

21% 

10 

31% 

Number of DSP48Es 

NA 

NA 

32 

66% 

Total equivalent gate count 

275,214 

NA 

195438 

NA 

















Chapter 3 


System Test 

3.1 DAC Interpolating Filter Measurement 


The equivalent baseband filter in the transmitter-receiver chain has to be taken into account in order to have optimum 
waveform at the receiver detector. This filter is generally not flat over the frequency band of interest, especially for 
wide bandwidth case. Estimating and controlling the equivalent baseband filter is not easy. As shown in Fig. 3.1, the 
overall filter effect is contributed from three sources: the DAC’s interpolating filter p(t) and RF front-ends including 
antennas at both sides. By omitting some scalar factor, the equivalent filter in baseband format is 

F(t) = F 0 (t) ® Fi(t) ® F 2 (t), (3.1) 

and Fo(t) can be expressed using the interpolating filter p(t) 

Fo(t) = p{t) 4- j ■ p(t). (3.2) 

According to our measurement, the shape of the equivalent filter is dominated by the interpolating filter p(t). Based 
on the ADC used in the testbed, p(t) has been accurately measured and Fo(t) = p(t) + j * p(t) has been used as an 
estimate of F(t). 

To measure the filter, a full scale (14-bit) 1-ns wide digital pulse directly generated in the FPGA chip is used as a 
trigger source, resulting in a analog pulse at the DAC’s output. This pulse, as shown in Fig. 3.2, is actually the 
interpolating pulse shape (filter) corresponding to sampling rate of 1 Gs/s. The pulse width is over 1 ns and there is 
a tail behind the pulse, implying imperfect circuit coupling. The spectrum of the interpolating filter is depicted in 
Fig. 3.3. 
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CHAPTER 3. SYSTEM TEST 



Transmitter 



i_i 

Receiver 

Figure 3.1: Equivalent filter in transmitter-receiver chain (not including the propagation channel), contributed from three 
sources. 



Figure 3.2: Measured interpolating pulse shape. 
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Figure 3.3: The spectrum of the interpolating filter. 
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CHAPTER 3. SYSTEM TEST 


Transmitter 


TX 


Confined Metal Box 


RX 


Receiver 


Furnitures, equipments,... 


Office 


Figure 3.4: Layout for measurement in metal box. 


3.2 Performance Measurement 


The system was tested in a typical officc/lab, non-line-of-sight (NLOS) environment. The layout for office/lab 
environment is illustrated in Fig. 3.4. The transmitter and the receiver are set in one room, blocked by a confined 
metallic box. The height of both antennas is about 1.5 m. Separation distance between the two antennas is not a 
good metric since the path loss and the multipath profile do not solely depend upon the distance. The path loss 
in this scenario is over 70 dB. Two 30 dB low-noise-amplifiers (LNA) are used to compensate the path loss at the 
transmitter and receiver, respectively. 

We first use vector network analyzer (VNA) to measure the channel and get the Time Reversal I/Q waveforms via 
the method mentioned in the January report. Fig. 3.5 shows channel transfer function (CTF) and Fig. 3.6 shows 
the I/Q modulation waveforms. Before loading the trimmed Time Reversal waveforms into FPGA, we load the 
full-length waveforms to the DAC’s waveform memory to verify the validity of Time Reversal. If a focused signal 
can be observed at the receiver, then we say the waveforms are correct. It can be easily observed that only the Time 
Reversal one is focused in time. 

Trimmed waveforms are loaded in FPGA to conduct the system trial. Receiver sensitivity is measured in this trial. 
We reduce the transmit power until the receiver can not detect any signal. Based on this criterion, it is found that 
with insignificant ISI, the receiver sensitivity (measured in average receive power) for the time reversal is around -81 
dBm. The transmit spectra of Time Reversal system and impulse system are depicted in Fig. 3.7 and their powers are 
6.1 dBm and 8.9 dBm respectively. The receive spectra of Time Reversal system and impulse system are depicted in 
Fig. 3.8 and both of the powers are -81 dBm. As a result, Time Reversal system has a 2.8 dB gain from the channel 
over impulse system. 
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Figure 3.5: Channel transfer function, measured data from VNA. 
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Figure 3.6: Quantized I/Q modulation waveforms for DAC. Top is the I channel waveform and bottom is the Q 
channel waveform. 
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Figure 3.7: Transmit speetra for Time Reversal system and impulse radio system. 
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Figure 3.8: Reeeive speetra for Time Reversal system and impulse radio system. 


















































Chapter 4 


Current and Near-Future Work 


Although a gain contributed by time reversal precoding has been achieved, we believe there are still some room to 
improve the hardware implementation and the performance. Moreover, we have started to design a new test-bed 
with compressive sensing capability. 


4.1 Synchronization Optimization 


The proposed 3-stage synchronization scheme follows proper design philosophy for burst mode communication. We 
have found that the synchronization performance varies depending on the system configuration. There are a number 
of parameters that may need to be optimized. However, globally optimizing the overall synchronization performance 
is too challenging. A practical optimization strategy is to combine pure analysis, measurement and simulation. A 
detailed methodology needs to be investigated before starting the optimization process. 


4.2 Newer DAC and ADC 


The DAC (MAX 108) used in the receiver is not ideal in terms of sampling rate and connection. We are planning to 
replace it with some newer product to reach a sampling rate of 2 Gs/s or above. Current the sampling rate is only 
800 Ms/s, which might cause performance degradation since the time reversal pulse width at the detector’s output 
is only 6 ns at the half height. If the sampling rate increases to 2 Gs/s, 12 samples can be effectively captured for 
each received pulse. The current connection between the ADC and the FPGA board is loose and the individual path 
lengths for different bits of a sample are not accurately aligned mainly due to physical PCB routing limitation in the 
cable-connector adapting PCB. 

A high speed ribbon cable/bus has been successfully used to connect the DAC and the FPGA board in the transmitter. 
However, It is doubtable for this type of cables to meet our higher sampling rate requirement. One solution to the 
bottleneck caused by connection is to use a signal processing package with an ADC (or a DAC) and an FPGA chip 
integrated on a single board. We are searching such type of boards and evaluating their performance to price ratios. 
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4.3 Non-coherent Threshold-less Detection 


OOK modulation has been adopted in the system mainly for its simplicity for non-coherent detection. The number 
one bothering problem in the system with OOK modulation and non-coherent detection is the sensitivity to the 
decision threshold. An optimum threshold depends on the SNR at the receiver, and it requires sophisticated signal 
processing to achieve the optimum, which prevents the performance from being the optimum practically. 

Per above discussion, non-coherent threshold-less detection is extremely desired. One solution is a combination of 
PPM modulation and non-coherent detection. Shown in Fig.4.1 is a conceptual receiver block diagram for PPM 
modulation with non-coherent detection. 



SI-SO 

Figure 4.1: Conceptual receiver block diagram for PPM modulation with non-coherent detection 


4.4 2-by-l MISO 


A 3-dB array gain can be anticipated if the number of transmitter branches increases from one to two. For the 
passband signaling approach, the carriers in the two branches must be strictly aligned, which requires identical 
DACs, modulators and power amplifiers in the both branches. The two branches can share a common local oscillator 
(LO). We are currently searching devices for this matter. 


4.5 Direct-Conversion Receiver 


To explore more communication techniques, especially those implemented at receiver side, a coherent receiver 
will provide more potential than a non-coherent receiver. Among all coherent receiver architectures is a direct- 
conversion receiver which is popular nowadays. This type of receiver is also called homodyne, synchrodyne or 
zero-IF receiver. An attractive feature of a direct-conversion receiver is that it requires less analog processing but 
more digital processing. This feature favors our design philosophy in terms of flexibility, thus it has been considered 
as our new receiver. In UWB band, a modulation-demodulation scheme to test on this receiver might be transmitted 
reference or its variants. 
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4.6 Real-Time Error Rate Measurement 


So far we have tested frame loss rate (due to synchronization failure) and bit error rate in a non-real-time way. A 
real-time error meter will be extremely desired since it can serve as an indicator in a system trial and make the 
measurement more efficient and effective. A stand-alone real-time error meter mainly consists of a data acquisition 
section as well as a data processing section, and it can be virtually used in different situations. A difficult part of a 
stand-alone error meter is to synchronize the meter with the incoming data stream. 
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4.7 Implementation of a Compressed Sensing Based Communication System 


We propose an implementable Compressed Sensing (CS) based communication system, as illustrated in Fig. 4.2. 
A 1 Ghz UWB signal is transmitted in the 10 GHz carrier and captured by a 25 Msps A/D. This system utilizes 
the concept in the October 2008’s report but with smaller bandwidth. This is due to the limitation of available RF 
hardware. The baseband signal must be modulated to carrier frequency because no antennas can transmit signal 
with frequency starting from DC. Sparse pulse position modulation (PPM) data is digitally filtered by a high rate PN 
filter in FPGA. For a Xilinx Virtex 5 FPGA, the rate of output signal can be up to 3.2 Gsps thanks to the advanced 
RocketlO technology. The circuity in FPGA acts as a virtual interpolating filter, transforming digital signal into 
analog waveform. Alternatively, we can build a PN sequence generator with rate up to even 80 Gsps by standard 
high rate shift register. However, this is not implementable due to the limitation of the state-of-art RF technology. To 
the best knowledge of the author, the largest bandwidth supported by a modulator is 1 GHz, provided by MITEQs 
SDM1015L13CDQ with 10 GHz - 15 GHz RF output. As a result, the output signal bandwidth from FPGA should 
not exceed 1 GHz. Power amplifier (PA) is used before transmitting the UWB signal over the air. The structure of 
the receiver is very simple. There is no down-conversion. A 25 Msps A/D samples the data directly after a low noise 
amplifier (LNA) and passband filter. FPGA is used to acquire the A/D data and process with recovery algorithms 
such as OMP, StOMP, etc.. 

The following is a list of key hardware found in the market in Fig. 4.2: 


• FPGA: Xilinx Virtex 5 LX330T with 3.25 Gsps RocketlO 

• Modulator: MITEQ SDM1015L13CDQ with 1 GHz modulation bandwidth, 10 GHz - 15 Ghz RF output 
frequency 

• Local Oscillator: MITEQ LPLM15000 with 8 GHz - 15 GHz output frequency 

• PA: MITEQ AMF-6B-08001800-60-34P with 8 GHz - 18 GHz frequency range, 37 dB gain 

• LNA: MITEQ AMFW-5S-10001500-250 with 10 GHz - 15 GHz frequency range, 30 dB gain 



Figure 4.2: Architecture of a proposed compressed sensing based communication system. 
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5.1 Introduction 


Waveform design or optimization is a key research issue in the current wireless communication system. Waveform 
should be designed according to the different requirements and objectives of system performance. For example, the 
waveform should be designed to carry more information to the receiver in terms of capacity. For navigation and 
geolocation, the ultra short waveform should be used to increase the resolution. If the energy detector is employed 
at the receiver, the waveform should be optimized such that the energy of the signal within integration window at 
the receiver should be maximized. In the context of cognitive radio, waveform design or optimization gives us more 
flexibility to design a radio which can coexist with other cognitive radios and primary radios. From cognitive radio's 
point of view, spectral mask constraint at the transmitter and the influence of Arbitrary Notch Filter at the receiver 
should be seriously considered for waveform design or optimization, except the consideration of the traditional com¬ 
munication objectives. Spectral mask constraint is imposed on the transmitted waveform such that cognitive radio 
has no interference to primary radio. While Arbitrary Notch Filter at the receiver is used to cancel the interference 
from primary radio to cognitive radio. Though Arbitrary Notch Filter is at the receiver, the influence of Arbitrary 
Notch Filter should be incorporated into waveform design at the transmitter. How to implement Arbitrary Notch 
Filter is out of scope of this chapter. 

The optimization problem is formulated as Quadratically Constrained Quadratic Program (QCQP), which is known 
as an NP-hard problem. However, after relaxation, QCQP can be solved by using Semidefinite Program (SDP) plus 
a randomized algorithm. In addition to SDP, a computationally-efficient iterative method is proposed to give the 
optimal solution for a special situation where the integrator only captures one sample after ADC as the decision 
statistic. 


5.2 System Description and Optimal Waveform 


The system architecture is shown in Figure 5.1. We limit our discussion to a single-user scenario, and consider the 
transmitted signal with OOK modulation given by 

oc 

s{t)= djp(t — jTb) (5.1) 

j=— OO 


where Tj, is the symbol duration, p(t) is the transmitted symbol waveform defined over '0, T p \ and dj € {0,1} is the 
j-th transmitted bit. Without loss of generality, assume the minimal propagation delay is equal to zero. The energy 
of p(t) is E p , 

T 

p 2 (t)dt = E p (5.2) 

and the frequency domain representation of p(t) is p/(f). Thus, 




(5.3) 


The received noise-polluted signal at the output of low noise amplifier (LNA) is 

r(t.) — h (t) <g) s (t) 4- n (t) (5.4) 

OO 

= ^2 d 3 x (t - i T b) + n ( t ) 

j=~ OO 


(5.5) 
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Figure 5.1: System architecture. 

where h(t) ,t £ [0, Th] is the multipath impulse response that takes into account the effect of channel impulse 
response, the RF front-ends in the transceivers such as Power Amplifier, LNA and Arbitrary Notch Filter as well as 
antennas. h(t) is available at the transmitter [1] [2] and the energy of h(t) is Eh, 



dt = Eh 


and the frequency domain representation of h(t) is hf(f). Thus, 


/ oo 

\hf (f)\ 2 df = E f 

-oo 


(5.6) 


(5.7) 


44 ®” denotes convolution operation. n(t) is a low-pass additive zero-mean Gaussian noise with one-sided bandwidth 
W and one-sided power spectral density Nq. x(t) is the received noiseless symbol- 44 !” waveform defined as 


x(t) = h(t) ®p(t) 


(5.8) 


We further assume that Xj, > Th 4 T p = X x , i.e. no existence of ISI. 

An energy detector performs square operation to r(t). Then the integrator does the integration over a given integra¬ 
tion window Tj. Corresponding to the time index k , the k- th decision statistic at the output of the integrator is given 
by 


z k = 


L 

L 


kTb+Tjo+T] 


kTb+Tj o 
^Tfe-HT/o+T’/ 


kTb+Tj o 


r 2 (t)dt 

(dkx(t - kTfy) + n(t)) 2 dt 


(5.9) 

(5.10) 


where Tj o is the starting time of integration for each symbol and 0 < T/o < T/o 4 Tj < T x < T^. 

An approximately equivalent SNR for the energy detector receiver, which provides the same detection performance 
when applied to a coherent receiver, is given as [3] 


SNReq = 


2(jJ' o ° +T 'z 2 

2.3 TiWN% + No x 2 ( t ) dt 


(5.11) 
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For best performance, the equivalent SNR SNRe q should be maximized. Define, 



rTi\ 

Jtj o 


Tiq+Ti 


x 1 (t) dt 


(5.12) 


For given T/, Nq and W, SNReq is the increasing function of Ej. So the maximization of SNReq in Equation 5.11 
is equivalent to the maximization of Ej in Equation 5.12. 


So the optimization problem to get the optimal p(t ) is shown below, 

max % 2 {t)dt 

s.t. Jq P p 2 (t) dt = E p 


(5.13) 


In order to solve the optimization problem 5.13, a numerical approach is employed in this document. In other words, 
p(t), h(t) and x(t) are uniformly sampled (assumed at Nyquist rate), and the optimization problem 5.13 will be 
converted to its corresponding diserete-time form. Assume the sampling period is T s . T p /T s = N p and N p is 
assumed to be even, T\ l fT s — and T x /T s = N x . So N x = N p + Nh. 

p(t ), h(t) and x(t) are represented by pi, i = 0,1,..., N p , h^i = 0,1,...,and x*, i = 0,1,..., N x respec¬ 
tively [3]. 

Define, 

P = [po Pi ■ ■ ■ PN p ] T (5.14) 

and 

X = [xo X\ ■■■ XN x } T (5.15) 


Construct channel matrix H^+i^Ap+t). 


( 


H) - = i 

\ 0, 


-j,0<i-j<N h 

else 


(5.16) 


where («) z ■ denotes the entry in the i-th row and j-th eolumn of the matrix or veetor. Meanwhile, for veetor, taking 
p as an example, (p) z x is equivalent to p z ~\. 


The matrix expression of Equation 5.8 is, 


x = Hp 


(5.17) 


and the constraint in the optimization problem 5.13 can be expressed as, 


2 T s = E p 


(5.18) 


where “I • 9 ” denotes the norm-2 of the vector. In order to make the whole document consistent, we further assume, 


= 1 


(5.19) 


Let Tj/T s = Nj and Tjq/T 9 = Nj q. The entries in x within integration window constitute X/ as, 

x/ = [a^Y/o 3-iV/o+i * * * ^TV/o+yv/] 


(5.20) 
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and Ej in Equation 5.12 can be equivalently shown as, 




Simply dropping T s in Ej will not affect the optimization objective, so Ej is redefined as, 

E, = ||xy||| 


Similar to Equation 5.17, x/ can be obtained by, 

x/ = H/p 

where (H i) t - = (H)jv /0 +i,j and z = 1,2,..., Nj + 1 as well as j = 1,2,..., N p + 1. 

The optimization problem 5.13 can be represented by its discrete-time form as, 

max Ej 

s.t. ||p||^ = 1 


(5.21) 

(5.22) 


(5.23) 


(5.24) 


The optimal solution p* for the optimization problem 5.24 is the dominant eigen-vector in the following eigen¬ 
function [3], 

HjH/p = Ap (5.25) 

Furthermore, E*j will be obtained by Equation 5.22 and Equation 5.23. 


5.3 Wideband Waveform Design in Cognitive Radio 


For cognitive radio, there is a spectral mask constraint for the transmitted waveform. Based on the previous discus¬ 
sion, p is assumed to be the transmitted waveform and F is the Fourier transform matrix, thus the frequency domain 
representation of p is, 

P / = Fp (5.26) 

where py is a complex value vector. If the z-th row of F is f 2 , then each complex value in py can be represented by, 



Thus the modulus of the complex value is 



(5.27) 


Define, 



(5.28) 


Given the spectral mask constraint in terms of power spectral density c = 


(p/)i,i = f *p 


= p 7 

= p t f,p 

< Ci,i = 1,2, 


T 


C\ 


C ^ + l 
2 T 


, SO 


Nr 


+ 1 


(5.29) 

(5.30) 

(5.31) 

(5.32) 
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So the optimization problem with spectral mask constraint can be expressed as, 

max £/ 


< 1 
2 — 1 


s.t. 


P Fip < Cl 


P T Fnj, p<cn£ 


(5.33) 


Define, 


Ho = HfH; 


(5.34) 


Equation 5.22, Equation 5.23 and Equation 5.34 are put into the optimization problem 5.33, and then this optimiza¬ 
tion problem will be formulated as, 

maxp 7 H 0 p 


s.t. 


p ; p < i 

P T Fip < cj 


P 7 F^ + 1 P <^ +1 


(5.35) 


In the optimization problem 5.35, both the objective function and the constraints are quadratic functions, so this kind 
of optimization problem is a QCQP. The general QCQP is a NP-hard problem. A semidefinite relaxation method is 
used in this document to give the suboptimal solution to the optimization problem 5.35. 

Define, 

P = pp 7 (5.36) 

So P should be the symmetric positive semidefinite matrix, i.e. P >-= 0 and rank of P should be equal to 1. 


Then, 


Ei = 


P 7 H 0 p 

(5.37) 

trace (Hopp^ ) 

(5.38) 

trace (H 0 P) 

(5.39) 


N 


and p 2 = trace (P), p 7 F z p = trace (F*P), i = 1,2,..., ^ + 1. 

Rank constraint is nonconvex constraint, so after dropping it, QCQP is relaxed to SDP, 

max trace (H 0 P) 

trace (P) < 1 
trace (FiP) < Ci 
s.t . 


(5.40) 


trace n p | t P^ < 


Cjv 


? + l 


The optimal solution P* of the optimization problem 5.40 can be obtained by using CVX tool [4] and the value of 
the objective function gives the upper bound of the optimal value in the optimization problem 5.35. If the rank of 
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P* is equal to 1, then the dominant eigen-vector of P* will be the optimal solution p* for the optimization problem 
5.35. But if the rank of P* is not equal to 1, the dominant eigen-vector of P* can not be treated as the optimal 
solution for the optimization problem 5.35, because of the violation of bound constraint. 

An efficient randomized algorithm is given by [5] [6] to obtain the feasible solution p* from P*. 

This algorithm will be repeated a sufficient number of times to get the relatively good feasible p* as the suboptimal 
solution to the optimization problem 5.35. 

When T[ approaches zero, which means the integrator only captures one sample after ADC as the decision statistic, 
the maximization of E\ will be equivalent to the maximization of x (T/o). An iterative algorithm is proposed here 
to give the solution to the maximization of x (T/o), which will be more computationally efficient than the previous 
semidefinite relaxation method. For the simplicity of the following presentation, T/o is assumed to be zero. 


From the inverse Fourier transform, 


/ oo 

x } (/) e^df 

-OO 


and from Equation 5.8, 

x /{f) = M f)pj(f) 

where xj (/) is the the frequency domain representation of x(t) respectively. 
So, 


x 


/ oo 

x f (/) df 

-oo 

-L 


hf (/) Pf (/) df 


If there is no spectral mask constraint, then according to the Cauchy—Schwarz inequality, 


(5.41) 

(5.42) 


(5.43) 

(5.44) 


roo 

x(0) = / h f (f)pf (f)df 

J —OO 

(5.45) 

/ roo roo 

< J \h/(f)\ 2 df \ Pf (f)\ 2 df 

V J —oo J —oo 

(5.46) 

f 

~ \J Eh Ep 

(5.47) 


when pf (/) = ahj (/) for all /, the equality is obtained. In this case, p(t) = ah(— t), which means the optimal 
waveform is the time reversed multipath impulse response. And, 


a = 


(5.48) 


If there is a spectral mask constraint, then the following optimization problem will become more complicated. 

max x (0) 

s-t- J^lPf {f)\ 2 df < 1 
Pf (/)| 2 < c f (/) 


(5.49) 
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where cj (/) represents the arbitrary spectral mask constraint. 

Because p/(f) is the complex value, the phase and the modulus of p/(f) should be determined. 


/ oo 

hf ( f)p f (/) df 

-oo 


-oo 

‘OO 


/ oo 

\hf (/)I e^arg(*,(/>) | p/ (/) | e j2^arg( P /(/)) rf/ 

-oo 


-oo 

•oo 


/ oo 

|/i, (/)| | P/ (/)| 

-00 


(5.50) 


(5.51) 


(5.52) 


where the angular component of the complex value is arg | •). 


For the real value signal x(t), 


where denotes conjugate operation. 


X/ (/) =*}(“/) 


(5.53) 


x/ (/) = |/iy (/)| | P/ (/)| e ^( ar g(' ! /^) +ar s(P/^))) 

X f (-/) = |/ty (/)| | P/ (/)| e -^( ar 8('*/(/))+ a rg(P/(/))) 
and x/(/) + Xf(—f) is equal to 

l /l / (/)l IP/ (/)| cos(2tt (arg ( h f (/)) + arg ( p f (/)))) 

If h(f) and |p/ (/)| is given, maximization of x (0) is equivalent to setting, 

arg(/i(/)) + arg(p(/)) = 0 

which means the angular component of pj (/) is the negative angular component of hj (/), 
The optimization problem 5.49 can be simplified as, 

s - t I- 00 \Pf (/)l 2 # < i 

Ip/(/)| 2 < C/ (/),/> o 


(5.54) 

(5.55) 


(5.56) 


(5.57) 


(5.58) 


Because, 


I hf (f)\ = \h f (-f)\ 

\Pf(f)\ = \Pf(~f)\ 
I Cf (/)l = |c/(-/)| 


(5.59) 

(5.60) 

(5.61) 


for all /. Thus uniformly discrete frequency points /o, ..., /a/ are considered in the optimization problem 5.58. 
Meanwhile, /o corresponds to the DC component and f \,..., /a/ correspond to the positive frequency component. 


Define vector hy, 


(h ) =1 M/i-l)l>* = 1 . 

f 1,1 1 'n/2 \hf (/t-i)|, i = 2,..., 4-1 


(5.62) 






5.3. WIDEBAND WAVEFORM DESIGN IN COGNITIVE RADIO 


33 


Define vector pf. 


/_ x J |P/(/<-i)h* = l 

^ 1,1 * \/2 |p/ (/t-i)|, i = 2,..., ^ + 1 


(5.63) 


Define vector cy 


/ v _ j V c f 1 

1 /; *' 1 1 N /2c / (/ i _ 1 ),j = 2,...,^ + l 


(5.64) 


Thus, the discrete version of the optimization problem 5.58 is shown below, 

max h /p/ 

s.t. Ilp/lll < 1 

Pf<Cf 


(5.65) 


So an iterative algorithm is proposed to get the optimal solution py to the optimization problem 5.65 as follows. 

1. Initialization: P = 1 and py is set to be an all-0 column vector. 

2. Solve the following optimization problem to get the optimal qy using Cauchy—Schwarz inequality. 


max h /q/ 
s.t. ||q/1 |2 < P 


(5.66) 


3. Find i, such that is the maximal value in the set < (qj) ^q}- ) > (c f) jtl 

^ J \ J ^ 

algorithm is terminated and p* f := py -f qy. Otherwise go to step 4. 


. If {z} = 0, then the 


4. Set (pj) j i = (€,)„,• 

5. P := P - (c/) i x and set (h/)^ x to zero. Go to step 2. 

When py is obtained for the optimization problem 5.65, from Equation 5.57 and Equation 5.63, the optimal p/(f) 
and the corresponding p(t) can be smoothly achieved. 

The proposed iterative algorithm gives the optimal solution to the optimization problem 5.65, which can be proofed 
as follows. 


Given column vector a, all the entries aj,..., in a are positive. Meanwhile, we assume ai > max {a 2 , 
We would like to solve the following optimization problem to get the optimal column vector b, 

max a 7 b 
s.t. b 7 b < P 
b\ < c 

< qj^/P 
c ^ N 

£ a? 

i—1 


■ ,o-n} 


(5.67) 
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For optimal solution b*, if b\ — c, then the optimum of the iterative algorithm is achieved. Set 61 = ac, where 
0 < a < 1 , then, 


N 


a r b = ^ a z b 2 


2=1 


N 


■— ci\b\ ~h / ^ 


2=2 


< aa\c + 


\ 


N 


af y/ P — q 2 c 2 


2=2 


(5.68) 

(5.69) 

(5.70) 


Let / (a) = aa\c + 


N 

£ a t VP — a 2 c 2 , the optimization problem is reduced to, 

i—2 


max / (a) 
s.t.O < a < 1 


(5.71) 


From / (a), we can get 




aic 


ac 



VP — a 2 c 2 


(5.72) 


Let (a) is equal to zero, then a is equal to ± ai ^ • Because of the constraint on c, \a\ is greater than 1. 

cEa? 

i=i 

Meanwhile, ^ (0) = a\c 9 which means ^ (a) keeps positive value in [0,1]. Thus / (a) is a monotone increasing 

function in [0,1 . arg max / (a) = 1. So, b\ = c. The optimum of the iterative algorithm is verified. 

<*€[ 0 , 1 ] 


5.4 Numerical Results 


Figure 5.2 shows the multipath impulse response h(t) under investigation in this document and the energy of h(t) 
is normalized. Figure 5.3 shows the frequency domain representation hf(f) of h(t). Green notch line in Figure 
5.3 shows the influence of Arbitrary Notch Filter at the receiver. T s = 0.5ns, Th = 100ns, T p = 100ns and 
77o + = 100ns. In the following presentation, the designed waveform, which is obtained by SDP and the rank of 

P* to the optimization problem 5.40 is not equal to 1, is called Waveform SDP Unverified; the designed waveform, 
which is obtained by SDP and the rank of P* to the optimization problem 5.40 is equal to 1 or which is obtained 
by SDP plus a randomized algorithm, is called Waveform SDP; the designed waveform, which is obtained by an 
iterative algorithm, is called Waveform by Iterative Algorithm. 

If T/ —> 0, which means the integrator only captures one sample after ADC as the decision statistic, then spectral 
mask, notch line of Arbitrary Notch Filter, the designed waveforms and multipath impulse response represented 
in the frequency domain are shown in Figure 5.4. Meanwhile, the designed waveforms in time domain are shown 
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Figure 5.2: Multhpath impulse response. 



Figure 5.3: Multipath impulse response and notch line of Arbitrary Notch Filter represented in the frequency domain. 

in Figure 5.5. Figure 5.6 shows the convolution between the designed waveforms and multipath channel impulse 
response. In this case spectral mask constraint is not tight. The peak value of noiseless signal in Figure 5.6 is I. 
If the arbitrary spectral mask constraint is imposed on the transmitted waveform, then spectral mask, notch line of 
Arbitrary Notch Filter, the designed waveforms and multipath impulse response represented in the frequency domain 
are shown in Figure 5.7. Meanwhile, the designed waveforms in time domain are shown in Figure 5.8. Figure 5.9 
shows the convolution between the designed waveforms and multipath channel impulse response. The peak value of 
the noiseless signal in Figure 5.9 is 0.94. This sharp peak can give us benefits for wireless communication. 

In sum, when Tj —> 0, the waveform by an iterative algorithm is the same as the waveform by SDP. SDP always gives 
the optimal solution to the optimization problem 5.35, which means the rank of P* to the optimization problem 5.40 
is always equal to 1. The reason of this result is that in this kind of situation the rank of H 0 is equal to 1. Meanwhile, 
the transmitted waveform satisfies the spectral mask constraint, which will make cognitive radio have no interference 
to primary radio. Besides, cognitive radio will transmit the waveform such that the energy will not be transmitted in 
the notched frequency band at the receiver in order to save the energy. Thus, the designed waveform can be treated 
as a wideband non-continuity signal. 

If Tj is equal to 20ns, then two different spectral masks are considered. One spectral mask and the corresponding 
notch line of Arbitrary Notch Filter, the designed waveforms and multipath impulse response represented in the 
frequency domain are shown in Figure 5.10. The other is shown in Figure 5.11. When Tj is larger than 0, only SDP 
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Figure 5.4: Spectral mask, notch line of Arbitrary Notch Filter, the designed waveforms and multipath impulse 
response represented in the frequency domain when Tj —► 0. 



Figure 5.5: Designed waveforms in the time domain when Tj —> 0. 

can not guarantee the optimal solution at all times because the rank of P* to the optimization problem 5.40 can not 
be always equal to 1. In the first case, only SDP gives the optimal solution. But in the second case, the waveform 
obtained only by SDP violates the spectral mask constraint in some frequency points, and SDP plus a randomized 
algorithm gives the suboptimal solution. 


5.5 Discussion 


This chapter deals with wideband waveform optimization with energy detector in cognitive radio. Wideband wave¬ 
form is designed according to the optimization objective with the considerations of the spectral mask constraint 
at the transmitter and the influence of Arbitrary Notch Filter at the receiver. The optimization problem is formu¬ 
lated as QCQP. After relaxation, QCQP can be solved by using SDP plus a randomized algorithm. Meanwhile, a 
computationally-efficient iterative method is proposed to give the optimal solution for a special situation where the 
integrator only captures one sample after ADC as the decision statistic. 
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Figure 5.6: Convolution between the designed waveforms and the multipath impulse response when Tj —> 0. 



Figure 5.7: Spectral mask, notch line of Arbitrary Notch Filter, the designed waveforms and multipath impulse 
response represented in the frequency domain when Tj —> 0. 



Figure 5.8: Designed waveforms in the time domain when Tj —► 0. 
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Figure 5.9: Convolution between the designed waveforms and the multipath impulse response when Tj 


0 . 



Figure 5.10: Spectral mask, notch line of Arbitrary Notch Filter, the designed waveforms and multipath impulse 
response represented in the frequency domain when Tj is 20ns. 
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Figure 5.11: Spectral mask, notch line of Arbitrary Notch Filter, the designed waveforms and multipath impulse 
response represented in the frequency domain when Tj is 20ns. 
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